Skip to content

Conversation

@Critsium-xy
Copy link
Collaborator

@Critsium-xy Critsium-xy commented Jan 13, 2025

Added vector_mul_vector, vector_div_vector and vector_add_vector in blas_connector.h

vector_mul_vector is used to multiply each of the vector elements, and vector_div_vector is used to divide.

vector_add_vector is used to caculate y = const1 * a + const2 * b

CUDA implement is also completed.

@Critsium-xy Critsium-xy marked this pull request as ready for review January 13, 2025 17:58
@Critsium-xy Critsium-xy marked this pull request as draft January 14, 2025 05:35
@Critsium-xy Critsium-xy changed the title [Feature] Complete Vector_mul_vector and vector_div_vector in blas_connector [Feature] Add vector_mul_vector, vector_div_vector and vector_add_vector in blas_connector Jan 14, 2025
@Critsium-xy Critsium-xy marked this pull request as ready for review January 14, 2025 05:40
@Critsium-xy Critsium-xy marked this pull request as draft January 14, 2025 16:59
@Critsium-xy Critsium-xy changed the title [Feature] Add vector_mul_vector, vector_div_vector and vector_add_vector in blas_connector [Feature] Add vector_mul_vector, vector_div_vector and vector_add_vector in blas_connector and added some GPU tests. Jan 15, 2025
@Critsium-xy Critsium-xy marked this pull request as ready for review January 15, 2025 09:36
@mohanchen mohanchen merged commit 0a0e19a into deepmodeling:develop Jan 15, 2025
14 checks passed
@mohanchen mohanchen added the GPU & DCU & HPC GPU and DCU and HPC related any issues label Jan 15, 2025
@Critsium-xy Critsium-xy deleted the blas_update_3 branch January 16, 2025 01:30
Fisherd99 pushed a commit to Fisherd99/abacus-BSE that referenced this pull request Mar 31, 2025
…tor in blas_connector and added some GPU tests. (deepmodeling#5858)

* Added some other necessary kernels

* Fix compiling bug

* XX

* Finish CUDA kernel

* Fix marcos

* Fix typename

* GPU implementation

* Fix bugs

* add vector_add_vector kernel

* Add blas_connector CPU tests

* Fix blas usgae

* Add initializer and GPU tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

GPU & DCU & HPC GPU and DCU and HPC related any issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants